Add torchao checkpoint tests #14074

metascroy · 2025-09-08T17:08:03Z

This PR adds new tests that the pre-quantized model checkpoints we publish on pytorch work with ExecuTorch (lowering and C++ runner).

qwen3-4b is tested for both lowering and runtime.

phi4-mini is tested for lowering. There appears to be a regression in the C++ HF tokenizer used in the ExecuTorch, and it no longer works with the phi4-mini tokenizer. See #14077

pytorch-bot · 2025-09-08T17:08:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14074

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Cancelled Job, 37 Pending

As of commit 2f47f54 with merge base a90e907 ():

NEW FAILURES - The following jobs have failed:

pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t e1ae02f92a417273d0139e33299744135fa051b0e92d0aa8df69efa43ba3c6bc /exec failed with exit code 1
pull / test-vulkan-operators-linux / linux-job (gh)
RuntimeError: Command docker exec -t f453a3922ee699a3a5d293047284884268168ab05c5fe90b236ae9b9944e2461 /exec failed with exit code 127

CANCELLED JOB - The following job was cancelled. Please retry:

pull / test-openvino-linux / linux-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-09-08T17:08:49Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

metascroy · 2025-09-08T19:10:20Z

.ci/scripts/test_torchao_huggingface_checkpoints.sh

+case "$MODEL_NAME" in
+  qwen3_4b)
+    echo "Running Qwen3-4B export..."
+    HF_MODEL_DIR=$(hf download metascroy/Qwen3-4B-INT8-INT4)


TODO: before landing, update the PyTorch checkpoint and change this to pytorch/Qwen3-4B-INT8-INT4

metascroy · 2025-09-08T19:10:35Z

.ci/scripts/test_torchao_huggingface_checkpoints.sh

+
+  phi_4_mini)
+    echo "Running Phi-4-mini export..."
+    HF_MODEL_DIR=$(hf download metascroy/Phi-4-mini-instruct-INT8-INT4)


TODO: before landing, update the PyTorch checkpoint and change this to pytorch/Phi-4-mini-instruct-INT8-INT4

mergennachin · 2025-09-09T13:03:20Z

.ci/scripts/test_torchao_huggingface_checkpoints.sh

+    cmake -DPYTHON_EXECUTABLE=python \
+        -DCMAKE_INSTALL_PREFIX=cmake-out \
+        -DEXECUTORCH_ENABLE_LOGGING=1 \
+        -DCMAKE_BUILD_TYPE=Release \
+        -DEXECUTORCH_BUILD_EXTENSION_DATA_LOADER=ON \
+        -DEXECUTORCH_BUILD_EXTENSION_FLAT_TENSOR=ON \
+        -DEXECUTORCH_BUILD_EXTENSION_MODULE=ON \
+        -DEXECUTORCH_BUILD_EXTENSION_TENSOR=ON \
+        -DEXECUTORCH_BUILD_XNNPACK=ON \
+        -DEXECUTORCH_BUILD_KERNELS_QUANTIZED=ON \
+        -DEXECUTORCH_BUILD_KERNELS_OPTIMIZED=ON \
+        -DEXECUTORCH_BUILD_EXTENSION_LLM_RUNNER=ON \
+        -DEXECUTORCH_BUILD_EXTENSION_LLM=ON \
+        -DEXECUTORCH_BUILD_KERNELS_LLM=ON \
+        -Bcmake-out .
+    cmake --build cmake-out -j16 --config Release --target install


shall we just test via the preset now?

https://github.com/pytorch/executorch/blob/main/examples/models/llama/README.md#step-3-run-on-your-computer-to-validate

cmake --preset llm -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX=cmake-out

This doesn't seem to work. I'm reverting back to not using the preset.

I filed an issue here: #14132

swolchok · 2025-09-11T22:50:34Z

.ci/scripts/test_torchao_huggingface_checkpoints.sh

@@ -0,0 +1,139 @@
+#!/usr/bin/env bash
+set -euo pipefail


please use -x as well for scripts that will run in CI; makes debugging easier

metascroy and others added 9 commits September 8, 2025 09:56

Support pytorch_bin format

93d300e

up

7e2fbd5

up

ab15cf7

up

024b0da

lint

57dbf96

Add quantized checkpoint tests

51f41cb

up

3af11d6

up

ba6fb4b

up

c3b6f0e

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 8, 2025

metascroy added ciflow/trunk and removed CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labels Sep 8, 2025

up

ed15610

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 8, 2025

metascroy added 3 commits September 8, 2025 10:40

up

62b25ca

up

e2a3abb

up

13004c2

metascroy commented Sep 8, 2025

View reviewed changes

metascroy mentioned this pull request Sep 8, 2025

Phi4-mini tokenizer.json does not work the ExecuTorch C++ runner #14077

Open

metascroy marked this pull request as ready for review September 8, 2025 19:18

metascroy requested review from jackzhxng, jerryzh168 and mergennachin September 8, 2025 19:18

jerryzh168 approved these changes Sep 8, 2025

View reviewed changes

mergennachin reviewed Sep 9, 2025

View reviewed changes

mergennachin approved these changes Sep 9, 2025

View reviewed changes

up

1a8e1d7

metascroy added 2 commits September 9, 2025 13:18

up

7647ed9

up

2f47f54

metascroy mentioned this pull request Sep 9, 2025

Directions for testing llama on desktop with cmake --preset llm does not work #14132

Open

metascroy merged commit dc944fe into main Sep 9, 2025
462 of 466 checks passed

metascroy deleted the add-torchao-checkpoint-tests branch September 9, 2025 22:05

swolchok reviewed Sep 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add torchao checkpoint tests #14074

Add torchao checkpoint tests #14074

Uh oh!

metascroy commented Sep 8, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 8, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 8, 2025

Uh oh!

metascroy Sep 8, 2025

Uh oh!

metascroy Sep 8, 2025

Uh oh!

mergennachin Sep 9, 2025

Uh oh!

metascroy Sep 9, 2025

Uh oh!

Uh oh!

swolchok Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add torchao checkpoint tests #14074

Add torchao checkpoint tests #14074

Uh oh!

Conversation

metascroy commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14074

❌ 2 New Failures, 1 Cancelled Job, 37 Pending

Uh oh!

github-actions bot commented Sep 8, 2025

This PR needs a release notes: label

Uh oh!

metascroy Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

metascroy Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

mergennachin Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

metascroy Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

swolchok Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

metascroy commented Sep 8, 2025 •

edited

Loading

pytorch-bot bot commented Sep 8, 2025 •

edited

Loading

This PR needs a `release notes:` label